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ABSTRACT 

Theoretical and numerical modeling of the assembly of dark-matter halos predicts that the most 
massive and luminous galaxies at high redshift are surrounded by overdensities of fainter companions. 
We test this prediction with Hubble Space Telescope observations acquired by our Brightest of Reion- 
izing Galaxies (BoRG) survey, which identified four very bright z ~ 8 candidates as Yogs-dropout 
sources in four of the 23 non-contiguous WFC3 fields observed. We extend here the search for Yogs- 
dropouts to fainter luminosities (M* galaxies with Mab ~ — 20), with detections at ^ 5a confidence 
(compared to the 8<r confidence threshold adopted earlier) identifying 17 new candidates. We demon- 
strate that there is a correlation between number counts of faint and bright Yogs-dropouts at ^ 99.84% 
confidence. Field BoRG58, which contains the best bright z ~ 8 candidate (Mab = —21.3), has the 
most significant overdensity of faint Yogs-dropouts. Four new sources are located within 70" (corre- 
sponding to 3.1 comoving Mpc at z = 8) from the previously known brighter z ~ 8 candidate. The 
overdensity of Yogs-dropouts in this field has a physical origin to very high confidence (p > 99.975%), 
independent of completeness and contamination rate of the Yogs-dropout selection. We modeled the 
overdensity by means of cosmological simulations and estimate that the principal dark matter halo has 
mass Mh « (4 — 7) x 10 11 M Q (~ 5a density peak) and is surrounded by several Mh ~ 10 11 M halos 
which could host the fainter dropouts. In this scenario, we predict that all halos will eventually merge 
into a Mh > 2 x 10 14 M© galaxy cluster by z ~ 0. Follow-up observations with ground and space 
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based telescopes are required to secure the z ~ 8 nature of the overdensity, discover new members, 
and measure their precise redshift. 

Subject headings: galaxies: evolution — galaxies: high-redshift 



1. INTRODUCTION 

A growing number of galaxies at z > 7 is being dis- 
covered with Wide Fi eld Camera 3 (WFC3) on the Hub- 



ble Sp a ce Telescope (lOesch et al.l l2010t iBouwens et al.l 
2010at iMcLure et all l2010t 



Finkelstein et al.l I2010F 



Wilki nsetalJ 120101: lYan"eTaLl 120111: ITrenti et all [2011)1 



These galaxies are identified from deep imaging in mul- 
tiple bands, showing the signature of absorption of pho- 
tons with wavelength shorter than Lyma n-q (Lyq) at 
121.6 nm by intervening neutral hydrogen ( jSteidel et al.l 
1996). Based on the wavelength range of the filter in 
which the photometric break is observed, an approxi- 
mate redshift is derived. The Lyman-break technique 
has been validated extensively wit h spectroscopic follow- 
ups of large samples at z < 7 (iMalhotra et al.l 120051 : 



IStark et alJ2010t lOno et alJ20lH iPentericci et alJ2011h . 
Non-ambiguous redshift measurements for galaxies at 
z > 7.2 are missing despite several attempts, possibly 
because of an increase in the neu tral fraction of hy- 
drogen compared to lower redshift (jLehnert et alJl2"010T ; 
iSchenker et aT][20ll . 

HST observations and their ground-based spectro- 
scopic follow-ups are providing a glimpse of the formation 
of stars and galaxies during the epoch of hydrogen reion- 
ization at z > 7, showing a declining star formation rate 
as the redshift incre ases and a rapidly ev olving galaxy 
luminosity function (jBouwens et al.ll2011f ). The evolu- 
tion of the luminosity function is consistent with the 
expected evolution of the dark-matter halo mass func- 
tion: at higher redshift, massive dark-matter halos be- 
come progressively ra rer and the typical mass evolves 
towa r d smaller values (iTrenti et al]|2010t iFinlator et al.l 
1201 li iJaacks et al.l 1201 11 1. It is thus not surprising that 
the majority of z > 7 galaxies, identified from pencil- 
beam surveys such as the HUDF, are faint, with magni- 
tudes mji25 > 27. 

To search for the brightest and rarest z ~ 8 galax- 
ies, we are carrying out a large area medium-deep HST 
survey along random lines of sight ([Trenti et al.l 1201 il 
hereafter Til). Bright galaxies, such as those iden- 
tified by our Brightest of Reionizing Galaxies (BoRG) 
survey, are expected to be highly clustered and to be 
found preferentially in groups rather than in isolation. 
This scenario has been verified observationally out to 
z < 6.5 by s everal groups, both in the context of 
Lyq emitters (IShimasaku et all 120031 : IVenemans et al.l 
12004 lOuchi et al.l 120051) and of Lyman-b r eak galaxies 
(lOverzier et all l2006at IZheng et al.l 120061: ICooke et al.1 
120081: lUtsumi et all 120101 : iCaoak et al.l 1201 laft . Cluster- 
ing of sources around a rare, bright galaxy extends to 
less luminous companions, under a broad assumption 
that galaxy luminosity is correlated with the mass of 

* BASED ON OBSERVATIONS MADE WITH THE NASA/ESA 
HUBBLE SPACE TELESCOPE, WHICH IS OPERATED BY 
THE ASSOCIATION OF UNIVERSITIES FOR RESEARCH 
IN ASTRONOMY, INC., UNDER NASA CONTRACT NAS 5- 
26555. THESE OBSERVATIONS ARE ASSOCIATED WITH 

PROGRAM 11700. 

Electronic address: trenti@colorado.edu 



the dark-matter host halo. This provides an indirect 
test to investigate whether the bright galaxy is at high 
redshift rather than a foreground contaminant that en- 
tered into_Jhe_Jjyjnan : hreak photometric selection win- 
dow ([Munoz fc Loebl[2008al) . 

In Til, we identified four z ~ 8 candidates with 
25.5 < toji25 < 26.7 and detections at > 8a confi- 
dence from an area of approximately 130 arcmin . In 
this paper we revisit the search for z ~ 8 candidates 
in the BoRG survey, extending the selection to fainter 
sources, detected at ^ 5a confidence. Our goal is to test 
whether the brightest z ~ 8 candidates are preferentially 
associated with overdensities of fainter dropouts, thereby 
providing support for the z ~ 8 nature of the br i ght can - 
didate based on the findings bv I Munoz fc Loebl (l2008ah . 
Our particular focus is on field BoRG58, which contains 
the z ~ 8 galaxy with the highest signal-to-noise ratio 
(S/N) found in the survey so far, with J-band magni- 
tude mji25 = 25.8 ± 0.2 (Til). The photometric quality 
of the data available for the field allow us to extend the 
search for fainter dropout sources with TOJ125 < 27.0. 

This paper is organized as follows. In Section [5] we 
summarize the data reduction and analysis, and in Sec- 
tion [3] we investigate the clustering of the newly identi- 
fied logs-dropouts. In Section 3] we discuss the proper- 
ties of the Y"o98-dropouts in field BoRG58 and perform 
extensive tests to evaluate the likelihood that we are ob- 
serving a 3D, physical overdensity in that field. Section[5] 
compares our observations to theoretical/numerical mod- 
els. Section [6] summarizes our findings and concludes 
by discussing the outlook for follow-up observations to 
strengthen the result s obtained in this wo rk. We adopt 
WMAP5 cosmol ogy (iKomatsu et al.ll2009f) and the AB 
magnitude scale (|Okdll974f T 



2. DATA REDUCTION AND DROPOUT 
SELECTION 

The BoRG survey is a four-band (F606W, F098M, 
F125W, F160W) pure-parallel HST/WFC3 survey de- 
signed primarily to identify bright galaxies (mjy^ < 27) 
at z > 7.5 based on their broad-band color s using 
the Lyman-break technique (jSteidel et al.l fl996[) . Data 
are associated primarily with program GO-11700 (PI 
Tre nti), complemen ted by program GO- 11 702 (PI Yan; 
see lYan et al .1 [2~0TlT ) . In this paper we consider the data 
from GO-11700 (23 out of 29 WFC3 pure-parallel point- 
ings) to guarantee a uniform design of the observations, 
as described in Til. Exposure times vary from point- 
ing to pointing because of the pure-parallel nature of the 
survey and are reported, along with the 5a limiting mag- 
nitudes in Table 1 of Til. For reference, we report here 
the exposure times and limits for field BoRG58, which 
contains the best z ~ 8 candidate identified in Til. 
That field, centered at RA = 219°. 230, Dccl = +50°. 719 
(J2000 system), was observed with the following expo- 
sure times (also indicated is the 5a sensitivity within 
an aperture of radius r = 0".32). F125W: t = 2509 s 
(m Hm = 27.0); F160W: t = 1806 s (m Hm = 26.6); 
F098M: t = 4912 s {m Hm = 27.1); F606W: t = 2754 s 
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{miim = 26.8). 

The observations are designed so that at least two dif- 
ferent exposures per filter were taken in each field, but 
there is no dithering, as the spectroscopic primary ob- 
servations do not use it. Individual exposures have been 
calibrated first with calwfc3, then aligned and registered 
on a common 0.08 arcsec /pixel scale using multidrizzle 
(|Koekemoer et al.ll2002l) . 

We identified sources fro m the Ji 2 s images using SEx- 
tractor in dual image mode (|Bertin fc Arnoutslll996l) , re- 
quiring at least 9 contiguous pixels with S/N > 0.7 af- 
ter normalization of the RMS maps as discussed in Til. 
Sources are accepted in our final catalog if they pass a 
minimum signal-to- noise (S/N) threshold with S/N > 5 
in J125 and S/N > 2.5 in H 16Q for ISOMAG fluxes. For 
Gaussian noise, uncorrelated in the two bands, there is 
a probability p < 6.5 x 10~ 9 that a noise fluctuation sat- 
isfies our source-selection criteria. This illustrates that 
the likelihood of introducing spurious sources within this 
catalog is very low, as demonstrated quantitatively in 
Section |U 

2.1. Lyman-break selection and contamination 

To select z ~ 8 candidates, we impose the following 
conditions on the sources in the catalog: 

m Y ms - rnj 125 > 1.75; (1) 

mji2 5 ~ m H i 6Q < 0.02 + 0.15 x (my 98 - rn J125 - 1.75), 

(2) 

and we finally require non-detection in Vqog (S/N < 2). 
Flux measurements are correc ted for foreground Ga lactic 
extinction using the maps bv lSchleeel etaTl (fl998h . For 
the determination of the myo98 ~™Ji25 color, we replace 
the flux in Yqqs with its la upper limit if the measure- 
ment is below that limit. This is the stand ard practice for 
the L yman-break galaxy selection (e.g., iBouwens et al.l 
2007). A different selection technique, based on the fit 
of the observed fluxes in all bands available to construct 
a photometric redshift probabili ty distribution, may re- 
sort to 2cr upper limits (e.g., see lMcLure et al1l2010l ). In 
this respect, we note that our la flux-replacement for 
the Y-J color would be equivalent to a 2a replacement 
while at the same time requiring a slightly less prominent 
break, that is myo98 — ™Ji25 > 1-5. Such selection is still 
m ore conservative than m ypgs — m.ji25 > 1-25 adopted 
by IBouwens et al.l (|2010al) , which nevertheless resorts to 
la limits. 

We measure the near-IR colors based on isophotal 
photometry without Point Spread Function (PSF) cor- 
rections, an approach that optimizes the signal-to-noise 
ratio, as determined by extensive Monte-Carlo simu - 
lations of artificial source recovery (lOesch et alJl2007h . 
Thi s approach has been adopted in sev eral other stud- 
ies (|Oesch et al.ll2009HOesch et al.ll2010t Til). Alterna- 
tive appro aches include resort ing to either fixed aper- 
tures (see iBunker et al.l l2010fl or to K ron-line ellipti- 
cal apertures fsee IBouwens et al.1 12007| ) . The variety 
of approaches present in the literature suggests that a 
gold standard is missing, primarily because results are 
comparable among all different methods, as shown by 
iFinkelstein et all (|2010f ). In addition, we verified directly 
that the impact of PSF corrections on the colors of the 
Yogs-dropouts in the BoRG survey is small. For this pur- 



pose, we corrected the isophotal photometry based on 
the WFC3 PSF model provided by STScI 1 , interpolat- 
ing the data provided over wavelength and radius. For 
the radius, we defined an effective aperture radius for 
each dropout based on the isophotal area used for the 
photometry. Faint dropouts are typically detected over 
an average of 20 pixels, that is an effective radius of 
r c s ~ 0".2. This leads to small corrections (compared 
to the random error on the colors): Ay 98-./i25 = 0.11 
and Aji2 5 -ffi60 = 0.14. 

Our color-color selection includes objects in the red- 
shift range 7.5 < z < 8.5 and is efficient at rejecting 
low-red shift interlopers and b rown-dwarf stars (Figure [TJ 
see also IBouwens et aI1l2010aD . The most likely contami- 
nants in the color selection are 1.5 < z < 2 galaxies with 
either a prominent Balmer break or with strong emission 
lines. In Til, we estimated a contamination fraction 
< 30% based on population synthesis modeling, which 
includes templates with emission lines, and comparison 
with the GOODS/ERS data for the bright dropouts dis- 
cussed there. 

Recently, lAtek et all (|201 If ) extrapolated data from 
a spectroscopic survey shallower than BoRG to derive 
a contamination rate from low-z galaxies with strong 
emission lines in the BoRG survey of one bright Yo98 _ 
dropout every 17 fields. They suggested that galaxies 
with a very faint continuum and flux in the J and H 
band given primarily by line emission (O II, A = 3727A; 
O III, A = 5007 A, and Ha, A = 6563 A) can en- 
ter our photometric selection window. For a galaxy 
with mji25 = 26, this implies a line flux <fi ~ 8 x 
10 -17 erg cm -2 s . Observationally, we have no evi- 
dence that this is the case for the bright Y~o98-dropout 
in field BoRG 58, which has been fo llowed-up with Keck 
spectroscopy. ISchenker et al.l (|2011l) observed that bright 
z ~ 8 galaxy candidate with NIRSPEC, covering the 
wavelength range 0.95 fim < A < 1.29 and reaching 
a median 10cr sensitivity of 2.2 x 10~ 17 erg cm -2 s _1 in 
the Nl filter and 4.3 x 10~ 17 erg cm" 2 s" 1 in the N2 fil- 
ter. No emission lines were detected. If the candidate 
had been an emis sion-line contaminant as suggested by 
lAtek et all ()2011l ). the line would have been clearly de- 
tected (at > 20a confidence) , unless the contaminant has 
an O III line located in a very narrow region of the spec- 
trum with AA ~ 0.01 /im around A = 1.30 /im. All other 
configurations are ruled out. In fact, if the flux in J is 
due to O III (2.0 < z < 2.7), O III moves redward of 
the Hiqq filter for O II at A > 1.27 /im. If instead the 
J125 flux is due to O III (1.2 < z < 1.8), then Ha moves 
out of Hiqq for O III at A > 1.30 /mi. Spectroscopic 
follow-up is therefore efficient at identifying this class of 
contaminants . In the case of the B oRG58 field, the ob- 
servations of ISchenker et al.l (|2011f ) essentially rule out 
that the brightest logs-dropout is an emission-line con- 
taminant. 

Because we are focusing here on fainter sources, we re- 
visited our estimate of contamination. For the fainter 
dropouts near the selection limit of our survey, we esti- 
mate that contamination can reach ~ 50%. This is c om- 
parable to that reported by IBouwens et all (|2010al ) for 
ERS >098-dropouts (w here there is an expected ~ 38% 
contamination; see also lMcLure et al]|2011l : ICapak et all 

1 http : //www. stsci . edu/hst/wf c3/ir_ee_model_smov . dat 
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2011b). Our data arc less deep in the optical bands, but 
we impose more stringent near-IR color-color cuts, which 
reduce the impact of photometric scatter at the expense 
of reduced completeness of our faint sample. 

Other sources of possible contaminants are galac- 
tic ultra-cool star s, which have red Y — J colors 
(jKnapp et all 12004) . However, the use of F098M as a 
dropout filter is efficient at rejecting this class of con- 
taminants when combined with our selection criteria 
^098 — ^125 > 1-7. As shown in Fig. [TJ the locus of 
brown dwarfs is well se parated from our selection win- 
dow for lo98-dropouts. iRvan et"aL1 (|2011| ) searched for 
brown dwarf stars in a data-set that included all the 
BoRG fields considered here and identified a total of 17 
ultra-cool stars in 230 arcmin 2 . All their candidates have 
Yqq S — J 12 5 < 1.4, so it is unlikely that any faint and un- 
resolved source with Y 09S — ^125 > 1.7 has stellar origin. 

3. CORRELATION BETWEEN FAINT AND 
BRIGHT Y098-DROPOUTS IN THE BORG 
SURVEY 

The z ~ 8 Lyman-break selection for faint sources dis- 
cussed in Section |2~T1 has been applied to the 23 BoRG 
fields from program GO- 11 700, with the goal of inves- 
tigating whether the BoRG survey contains evidence of 
clustering of fainter Yogs-dropouts around the brighter 
sources identified in Til. Sources have been inspected 
visually to ensure that they are not artifacts such as 
hot pixels, diffraction spikes of stars, or outer regions 
of brighter foreground galaxies. 

A total of 21 sources satisfy the Yogs-dropout selec- 
tion (Table [TJ). Therefore, we identify 17 new candidates 
(defined as "faint" sample) in addition to the four pub- 
lished in Til (defined as "bright" sample). The "bright" 
versus "faint" classification is based on a S/N selection 
in the J125 band, with bright meaning S/N > 8. The 
number counts per field of the 21 candidates are dis- 
tributed as follows. Twelve fields have no sources satis- 
fying our selection criteria, and six fields contain a single 
faint candidate. The remaining five fields have multiple 
candidates. Four of these five fields are those that con- 
tain one of the bright candidates published in Til. In 
BoRGlk and B0RG66 we find an additional Y-dropout. 
BoRGOt has two faint dropouts in addition to the bright 
one we already reported, while four additional candidates 
have been discovered in field BoRG58. Finally, three 
candidates are present in field BoRGOy, including one 
with mji25 = 26.6 that barely misses the S/N cutoff 
adopted in Til (J125 detection at 7.9a confidence, -Hi6o 
at 4.5er). Table [TJ summarizes our findings. The dis- 
tribution of the number counts is presented in Figure [2] 
and compared to the expectation from the null hypoth- 
esis (no clustering), which is a Poisson distribution with 
average value (N) = 0.913. This comparison has been 
suggested as a method to infer the clustering properties 
of dro pout samples in pure-parallel surveys iRobertsonl 
(2010). Visual inspection immediately identifies the ex- 
cess at N = 5 (field BoRG58), which will be investigated 
in detail in the subsequent sections of the paper. The 
remaining of the number-counts distribution appears to 
deviate slightly from Poisson, but because of the small 
number of fields and dropouts per field, a marked differ- 
ence is neither expected nor statistically significant. At 
low number counts, Poisson uncertainty dominates the 



contr ibution to the standard deviation of th e distribu- 
tion (|Trenti fc Stiavellill2008t lRobertsonll2"010t ). 

To analyze the clustering of dropouts, there is however 
fundamental information that is not conveyed in Fig- 
ure [2] All four fields with a bright z ~ 8 candidate from 
Til contain at least a second, fainter, Yogs-dropout. To 
investigate the chance of this happening, we resorted to a 
Monte-Carlo simulation to derive the probability that a 
random distribution of the 17 new sources among the 23 
fields analyzed leads to eight or more faint sources in the 
four fields with a brighter candidate, with a minimum 
of one source for each field. We carried out 10 5 ran- 
dom realizations and derived that such occurrence has 
probability p — 0.16%. Therefore, the association be- 
tween the newly discovered dropouts and the brighter 
previously reported sources is significant at greater than 
99.84% confidence. The presence of multiple dropouts in 
field BoRGOy qualitatively strengthens the confidence of 
the result, as the brightest dropout in that field barely 
misses the S/N cutoff of Til and therefore our definition 
of "bright" dropout. 

Our choice to define "bright" and "faint" dropouts 
based on a S/N measurement rather than absolute lumi- 
nosity minimizes the impact of differences in the expo- 
sure times among the BoRG fields. In fact, the luminos- 
ity function of Yogs-dropouts around the detection limit 
of the BoRG survey, (mu m ) — 26.7, can be locally ap- 
proximated by a power law, so that the relative expected 
abundance of bright and faint sources is approximately 
uniform across the survey. In addition, we verified that 
the correlation in the number counts between the faint 
and bright sample of dropouts is not arising because the 
fields containing bright candidates have deeper integra- 
tion times. The integration time of the fields containing a 
dropout identified in Til is representative for the survey: 
of the four, two have below median depths and two are 
above median. In particular, field BoRG58, containing 
the most significant excess of dropouts, has an exposure 
time t = 2500 s in F125W, only slightly deeper than the 
median exposure of the survey (t = 2200 s in F125W). 

Foreground Galactic reddening is not the cause of the 
observed correlation between faint and bright dropouts. 
From the lSchlegel et al.l ()1998j ) maps, all fields with z ~ 8 
candidates have low extinction values (Ab < 0.24). The 
average extinction for the sample of the four fields with 
bright candidates is (A B ) = 0.07±0.04 (and BoRG58 has 
Ab = 0.06), while the average extinction for the fields 
containing only faint candidates is (Ab) — 0.17 ±0.07. 
Such low extinction values imply that the reddening cor- 
rections we apply to our near-IR photometry are very 
small (Am < 0.05). 

Finally, we note that the analysis presented here on 
the correlation between bright and faint dropouts is com- 
pletely independent of contamination and completeness 
of the selection, as it is uniquely based on number counts. 
Both contamination and incompleteness actually con- 
tribute toward reducing the significance of clustering. In 
fact, if the sample of dropouts consists of a population 
of bona-fide z ~ 8 objects plus some contaminants at 
1 < z < 2, no angular cross-correlation is present be- 
tween the two populations. In such case, the clustering 
signal is less strong than for the case in which the whole 
sample is in the same redshift interval. Incompleteness 
also reduces clustering. For a fixed number of objects, 
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an incomplete sample contains objects at fainter lumi- 
nosities compared to a complete sample. Because galaxy 
luminosity correlates with hal o mass, this reduces the av- 
erage bias of the population ()Trenti fc Stiavelhll2008l ) 2 . 

So far, our analysis has focused only on the global 
correlation of counts in the survey, showing that faint 
dropouts are preferentially found in fields with bright 
dropouts at a confidence level greater than 99.84% re- 
gardless of contamination and completeness of the selec- 
tion. In the next Section we investigate in details the 
properties of field BoRG58, which contains the highest 
number of dropouts among the BoRG survey fields. 

4. CANDIDATE Z ~ 8 LYMAN-BREAK GALAXIES 
IN FIELD BORG58 

Five objects within field BoRG58 satisfy the selection 
criteria for Y"o98-dropout sources (see Table [2|). In addi- 
tion to the mji2 5 = 25.8 source BoRG58_l 7871420 dis- 
cussed in Til, we identify four additional sources with 
magnitudes rajy^, ~ 27.0. All candidates are located 
within a subregion of diameter d = 70" of the 127" x 135" 
WFC3 field (Figure |3|). At z = 8, this separation cor- 
responds to 3.1 comoving Mpc. All sources are unde- 
tected in Yo98 and consistent with the same redshift of 
z ~ 8 (Figure [T] and Table |2J). The average J125 — i?i60 
color of the four fainter sources (BoRG58_14061418 - 
BoRG58_14550613) appears marginally bluer (A(Ji25 — 
-H160) ~ —0.2) than that of the brightest galaxy 
(BoRG58.17871420). About half of this difference is 
accounted by PSF effects. The bright dropout is de- 
tected over a larger effective radius (r e ff ~ 0".33), 
hence the difference in encircled energy between J125 and 
Hiso is smaller (0.05 versus 0.14 derived for the fainter 
dropouts). The remaining difference after PSF match 
(A(Ji25 — i?i6o) ~ —0.1) could be due to faint z ~ 8 
galaxies having bluer UV slopes (jBouwens "et~aT1l2010b[ ). 

As field BoRG58 appears unique among the survey, we 
performed several tests with the goal (1) of establish- 
ing that the dropouts are real galaxies rather than noise 
peaks and (2) of quantifying the statistical significance 
of the dropout overdensity in the field. 

4.1. Tests to establish reality of the dropouts 

To demonstrate that the detections of the four fainter 
sources BoRG58_14061418 - BoRG58_14550613 are real, 
we performed the following tests. 

Data quality control: We verified that the RMS 
maps around the source locations are regular and do not 
indicate hot/anomalous pixels. 

Negative image test: This test allows us to charac- 
terize non-gaussianity in the tails of the noise distribu- 
tion and therefore to assess the likelihood that detections 
at 5cr confidence are spurious. Using the SExtrac- 
tor segmentation map, we masked sources in our detec- 
tion image (J125), replacing the pixels at these locations 
with pixels sampled randomly from regions of the im- 
age not containing any source. We then inverted the 
image and proceeded to run SExtractor with the same 
parameters used for the actual science image. In the 
negative J125 image, no sources are detected at con- 
fidence level 4.4cr. Hence it is unlikely that the 

2 This effect is also illustrated by our pub- 
lic; cosmic variance calculator available at 

http: //casa. Colorado . edu/~ trenti/CosmicVariance .html 



four dropouts BoRG58_14061418 - BoRG58_14550613 are 
spurious sources as they are detected at ^ 5a in the J125 
science image. 

H-image random shift test: Next, we evaluated the 
likelihood that the low-significance detection (> 2.5cr) in 
Hiqo arises from noise fluctuations, rather than being 
intrinsically associated to flux from sources detected in 
Ji25- This is especially important in light of the minor 
misalignment (0".15) of BoRG58_14550613 in these two 
filters (see Figure [3]) . In this test we essentially replace 
the science image in the H band with a noise-only im- 
age. Therefore, we should expect that the number of 
sources that have flux in both the J125 and Hi 60 bands 
at level sufficient to pass the Yo98-dropout selection is 
much smaller compared to the number of sources found 
when using the H-band science image. For this test, we 
masked sources in the Hi 60 image as described above 
for the negative image test. Next, we randomly shifted 
it along both axes using periodic boundary conditions. 
Then we ran SExtractor and constructed source catalogs 
in all bands, replacing -H160 with the masked and shifted 
copy. We repeated the experiment 100 times with dif- 
ferent shifts, determining an expected contamination of 
Ncontam = 0.03 objects in the Yo98-dropouts catalog (a 
total of three objects among the 100 MC realizations, 
with at most one dropout per realization). The proba- 
bility of detecting four objects when 0.03 are expected is 
p < 10~ 7 . This test thus clearly shows that the four faint 
sources surrounding the brighter BoRG58_l 7871420 have 
correlated flux in the J125 and Hiqq images, strengthen- 
ing the likelihood that they are real. 

Detector-induced noise test: Next, we evaluated 
whether correlated flux in J 12 5 and -H160 arises from de- 
tector noise (e.g., charge persistence). The BoRG survey 
observations have been designed so that exposures in J125 
and Hiqq are preceded in the same HST orbit by a Yo98 
exposure of comparable length; additional time is de- 
voted to F098 imaging in different orbits (Til). Without 
dithering, a source falls on the same detector pixels in 
every IR filter for exposures in any given orbit. We thus 
combined with multidrizzle the subset of Y098 observa- 
tions in the two orbits shared with J125 and H160 (visits 
60 and 62) and measured the Yo98 flux at the location of 
the dropouts. All dropouts have S/N < 1 in this Y098 
sub-image, in agreement with the non-detection using 
the full combination of all Y098 exposures. This demon- 
strates that the detector pixels where the dropout sources 
are located in J125 and Hiqq are not providing anoma- 
lously high electron counts. This test also demonstrates 
that the faint dropouts are not spurious sources induced 
by detector persistence. Otherwise, the Y098 sub-image 
would have been affected more than the J125 one (see Til 
for a more detailed discussion of our optimization of the 
exposure sequence to avoid persistence-induced spurious 
dropouts). Instrumental origin of the correlated flux in 
J125 and Hiqo for the faint dropouts is ruled out. 

4.2. Contamination and completeness 

Improved rejection of contaminants: We stacked 
the images of the five logs-dropouts identified in the 
BoRG58 field to verify that there is no flux in the Vqoq 
and Yo98 bands (see Figured]). Within a circular aperture 
of radius r — 0".32, there is indeed no measured flux in 
these bands, both for the stack of the four fainter sources 
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as well as for the stack of all five objects. Within the same 
aperture, the stacked images of the four fainter sources 
have S/N = 8.1 in J 125 and S/N = 5.0 in H 160 . These 
values are consistent with the ISOMAG S/N measured by 
SExtractor for the individual sources (Table [2]). Stacking 
allows us to set limits that are ^0.75 mag deeper than 
each individual non-detection in Vqoq and Fo98 , ruling out 
to higher confidence the possibility that the majority of 
the dropouts are low-redshift contaminants. 

Dwarf-star contamination: As discussed in Sec- 
tion 12.11 our y"o98 _ dropout selection is highly unlikely 
to include Galactic ultra-cool dwarfs because we require 
Yr,9H — J12R > 1.7, while these stars have Fogs — ^125 < 1-4 
(|Rvan et al.l 1201 it ). Furthermore, such contamination is 
even less likely in field BoRG58 compared to a typical 
field in the BoRG survey. In fact, the BoRG58 line of 
sight has a high Galactic latitude (b ga i — 59°. 0) which is 
in the top 25% of the distribution for \b ga i\ among the 
survey fields considered here. Finally, we measured size 
and ellipticity for the dropouts and compared them to 
the PSF. The profiles of both the bright dropout itself as 
well as of a stack of all faint dropouts show clear signs of 
elongation in both J and H bands (ellipticity 0.25±0.10), 
unlike expected for a star (ellipticity < 0.05). While 
the PSF is undersampled in our images due to the lack 
of dithering, the stacks and the bright dropout are also 
found to be wider than the PSF (FWHM= 0."28±0."02 
for the sources versus FWHM=0."218 for the PSF). 
Therefore, these tests provide further direct evidence 
that contributes to excluding stellar contamination. 

Completeness: We resorted to Monte-Carlo experi- 
ments that recover artificial sources of varying brightness 
to evaluat e the comple t eness of the data, following the 
method of lOesch et al.l (|2007j ). Our code evaluates the 
fraction of input galaxies (modeled as Sersic profiles) that 
are recovered as a function of magnitude given our cat- 
alog construction and color-color selection criteria. We 
used input galaxies with magnitude 24 < TO/125 < 28 and 
a Gaussian UV-slope distribution (/3 = —2.2 ± 0.4). We 
simulated galaxies in the redshift range 6.6 < z < 9.6 
and adopted a log-normal input size distribution with 
mean radius r = 0."175 x 8/(1 + z). We find 50% com- 
pleteness at TOJ125 = 26.88 and > 20% completeness at 
m Ji25 = 27.2, corresponding to the magnitude of our 
faintest candidate (see Fig. 0]). 

4.3. Statistical significance of clustering in BoRG58 

In this section we quantify the statistical significance of 
the clustering of dropouts in field BoR58. First we com- 
pare the overdensity observed against Poisson statistics, 
which has the advantage of providing a measure of the 
clustering significance that is independent of contamina- 
tion and completeness as we discuss in Section [3] Next, 
we include the effect of cosmic variance (galaxy cluster- 
ing) to assess the overdensity significance. 

As discussed in Section [3J field BoRG58 contains the 
highest number of dropouts among the fields searched. 
From a simple counts-in-cell statistics considering the 
full field of view of WFC3, we derive that an over- 
density of 5 dropouts or more under the expectation 
of (N) = 21/23 = 0.913 is realized with probability 
P(N > 5) = 0.0025 under Poisson statistics. As field 
BoRG58 was identified in Til as the one containing the 
best and most robust candidate in the survey, therefore 



establishing an a priori expectation for clustering, we are 
justified in considering the dropout overdensity signifi- 
cant at > 99.75% confidence. If we were to consider 
a more conservative measure and consider the probabil- 
ity of finding one overdensity with N > 5 Yo98 _ dropouts 
among the four fields with a bright candidate from Til, 
we would still obtain that the dropouts are clustered at 
> 99% confidence. 

Furthermore, the dropouts in BoRG58 are located 
within a subregion of the WFC3 field of view; therefore 
the overdensity is even more significant. To correctly 
model the finite field of view and edge effects, we resorted 
to a Monte-Carlo simulation to quantify the probability 
that five sources are located within a region with d = 70" 
within the 127" x 135" WFC3 field if they are distributed 
uniformly with (N) = 0.913 per full WFC3 field. Such 
configuration happens with probability p = 7.2 x 10~ 5 
for field BoRG58, or with probability p = 2.88 x 10~ 4 
under a more robust measure of requiring one overden- 
sity among the four fields with bright dropouts. In both 
cases, the significance of the overdensity is increased by 
a factor ~ 35 compared to the counts-per-field estimate. 

The comparison with Poisson statistics demonstrates, 
to very high confidence level (p > 0.999932), that the 
overdensity of sources in field BoRG58 is physical rather 
than originating from random fluctuations. The dis- 
tribution of galaxy counts in pencil-beam surveys dif- 
fers from Poisson statistics because of clustering (cosmic 
variance) , as measured from the two-point correlation 
function (|PeeblesHl98C[ ). Therefore we take into account 
cosmic variance to evaluate whether the overdensity of 
galaxies in BoRG58 is likely associated to a protocluster 
of galaxies, with the (majority of) dropouts in a narrow 
redshift range. The alternative scenario is a fluctuation 
in the number counts that is made more probable be- 
cause of projected galaxy clustering from sources that 
are at different redshift within the lo98 _ dropout selec- 
tion window. 

As discussed in Section[3j the Fo98-dropout sample has 
the highest bias (that is strongest clustering) in the case 
where the survey is complete and there is no contam- 
ination. We recall that this is because an incomplete 
survey samples lower halo masses and because contam- 
inants and z ~ 8 dropouts have no cross-correlation. 
If we assume a mean number density of (N) = 0.913 
z ~ 8 candidates per WFC3 field, we derive from our 
cosmic variance calculat or a bias 3 b = 8.9, th at is a 
cosmic variance of 48% (jTrenti fc StiavellH l2008h . This 
estimate relies on an abundance match of high-redshift 
galaxies with dark-matter halos, whic h is the working as- 
sump t ion adopted by several s tudies (iNewman fc Davis! 
[200a ISomerville et al.l 12001 iTrenti fc Stiavellil 120081: 
iMunoz 1 et al.ll2010l: lRobertsonll2010D . This modeling has 
been tested observationally at z < 6 by determination of 
the two-point correlation function of Lym an-break galax- 
ies (|Overzier et all2006bt Eee et al. 2009), and indirectly 
at 7 < z < 10 because conditional luminosity func- 
tion models correctly predict the observed abundance 

3 Note that the bias of the population of contaminants is sig- 
nificantly lower than the bias of z ~ 8 sources with the same 
luminos i ty. F rom the conditional luminosity function model of 
ICooravl 1)20051 ). a contaminant at z ~ 2 with observed magnitude 
m Jl25 ~ 26.5 (Mji2S ~ —18.3) is expected to live in a dark-matter 
halo of mass Mh a ; PS 10 11 Mq with bias b i=a 1.5. 
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of dropo ut galaxies in the HUDF and ERS/C ANDELS 
surveys (jTrenti et all 120101 : lOesch et all 120 11( 1. In any 
case, the uncertainty in the bias of the Y-dropout sam- 
ple propagates only mildly into the final determination of 
the number counts fluctuations. This is because cosmic 
variance and Poisson dispersion are summed in quadra- 
ture; therefore, when the expected number of objects 
per field is smaller or comparable to unity, Poisson un- 
certainty dominates the total dispersion. In fact, if we 
assume a pencil beam of d = 70" with redshift selec- 
tion 7.5 < z < 8.5 (Til), we derive a variance of 0.53 
Fo98-dropouts per cell of diameter d = 70" around their 
average value of 0.20. That means that the overdensity 
of five logs-dropouts in BoRG58 is significant at 8.9cr, 
after including the effects of galaxy clustering. For refer- 
ence, under Poisson statistics the overdensity would have 
been significant at the lO.lcr level. 

An approximate analytical form for the number counts 
distribution in presence of cosmic variance could be 
derived analytically under the same assumption that 
light traces dar k matter (e.g., lAdelberger et al.l 119981 : 
iRobertsonl I2010D . but that would not allow us to dis- 
tinguish excess number counts originating from physi- 
cal overdensities versus line-of-sight random superposi- 
tions. Therefore, we traced a pencil beam trough a cos- 
mological simulat i on, fo llowing the method described in 
ITrenti fc Stiavellil (|2008l) to construct the full probabil- 
ity distribution of the expected number counts (see also 
iMunoz et al.l 120101 We used the box of edge hox = 
100 Mpc h' 1 (h = 0.7) simulated with N v = 5 12 3 DM 
particles as discussed in lTrenti fe Stia vclli (2008). Snap- 
shots and halo catalogs were constructed every Az = 
0.125, allowing us to model the redshift evolution of the 
box over multiple snapshots as the lo98 _ dropout pencil 
beam is traced (Az = 1 centered at z = 8). We set a min- 
imum threshold of 92 particles for the dark-matter halos 
to be included in the pencil beam, as this gives an aver- 
age number of counts in a WFC3-like pencil-beam that is 
equal to the observed value of (Nydropout) — 0.913 (every 
dark-matter halo above the threshold is populated with a 
galaxy). We then considered a pencil beam equivalent in 
volume to the d — 70" region that contains the BoRG58 
overdensity, but shaped as a parallelepiped for compu- 
tational reasons: 2.83 x 2.83 x 311 Mpc 3 (corresponding 
to 62" x 62"xAz, with Az = 1.0 at (z) = 8.0). We ran 
4000 random realizations of the pencil beam, obtaining 
(N) = 0.2 and variance 0.53, in agreement with the ana- 
lytical estimate discussed above. The full number count 
distribution of the logs-dropout counts is presented in 
Figure |6] Five realizations in 4000 have N > 5. Only 
one of these five fields with high number counts arises 
because of superposition of dropouts at different redshift 
along the pencil beam. The remaining realizations have 
all dropouts within a region of comoving size 5 Mpc. 
This numerical experiment allows us to set a probability 
p = 1 - 1/4000 = 0.99975 that the BoRG58 overdensity 
is a not a result of galaxy clustering along the line of 
sight. In addition, in the case of random superposition 
along the line of sight, all dark-matter halos have compa- 
rable masses near the low-mass cutoff to be included in 
the beam. In contrast, the fields that include 3D clusters 
have at least one halo with mass about four times higher 
than the minimum mass. Under the assumption that 
luminosity correlates with mass, the presence of a very 



luminous galaxy as in the case of field BoRG58 further 
strengthens the likelihood of a proto-cluster scenario. 

Finally, we note that the overdensity of Fogs-dropout 
sources observed in BoRG58 is also more significant 
than any overdensity of z ~ 8 sources found among 
the 10 WFC3 pointin g in the GOODS/ERS fields by 
iBouwens et all (|2010af) who found two z ~ 8 candidates 
in a region of diameter d = 74". 

To summarize, all tests performed imply that the logs- 
dropout overdensity in BoRG58 is real and has high sta- 
tistical significance. To investigate in more detail a possi- 
ble scenario for the physical origin of the overdensity, we 
resort to numerical modeling of galaxy assembly during 
the epoch of reionization in the next Section. 

5. THEORETICAL/NUMERICAL MODE1ING OF 
THE OVERDENSITY 

Under the assumption that the five lo98 _ dropout of 
field BoRG58 are at z ~ 8, we can infer the dark-matter 
properties of the halos in which they reside by resort- 
ing to t he Improved Condi tional Luminosity Function 
model of ITrenti et all ((2010) . From their halo-mass ver- 
sus galaxy-luminosity, we infer that the brightest mem- 
ber of the overdensity (Mab = —21.3) lives in a Mh ~ 
(7 ± 2) x 10 11 M Q halo. Such halo is a 5cr density peak 
with comoving space density « 9 x 10 _ 7 Mpc -3 , which 
has approximately a 20% chance of being present within 
the volume probed by the BoRG survey (2.3 x 10 5 Mpc 3 ). 
This mild tension could be eased if there is some redshift 
evolution of the halo-mass versus galaxy-luminosity re- 
lation so that galaxies at highe r z become brighter a t 
fixe d halo mass as su ggested bv lMunoz fc LoebT(|2011h : 
the ITrenti et al.|[2010l model is calibrated at z = 6 while 
the Y-dropouts are at z ~ 8. If this is the case, we can 
derive instead the halo mass from number-density match- 
ing, obtaining a slightly lower value (Mh ~ 4x 10 11 Mq). 
The fainter galaxies (Mab ~ —20.0) are expected to 
be hosted in lower mass, and more common, halos with 
M h « 10 11 M (4cr peaks). 

Using exte n ded Press-Schechter modeling 
(jMunoz fc Loebl l2008ah . we expect on average 
N = 4.8 halos with M h > 10 11 M surround- 
ing BoRG58_17871420 within a sphere of diameter 
d = 3.1 Mpc (comoving) corresponding to d < 70". For 
reference, the average number of halos above this mass 
threshold for a random region of the universe of the same 
volume at z = 8 is (N) < 10~ 3 . A massive dark matter 
halo like the one expected for BoRG58_l 7871420 boosts 
by more than three orders of magnitude the number 
density of halos that can host galaxies sufficiently 
luminous to be detected. Similar results are given by 
a more powerful clustering model origi nally developed 
to des cribe the Shapley Supercluster (|Mufioz fc Loebl 
2008b). This analytical estimate shows that clustering 
of fainter dropouts around a very bright dropout is 
naturally expected. The prediction can be extended 
further to argue that if no clustering is observed at all, 
then it is likely that the very bright dropout is not at 
z ~ 8, but rather a foreground contaminant, because 
in this case the source is within a d ark-matter halo 
with lo wer bias (see Section 14.31 and also [Munoz fc Loebl 
l2008all We note that, as discussed in Section [3l all the 
bright dropouts in the BoRG survey show evidence of 
clustering. The signal is not as strong as in the case of 
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BoRG58, but is still significant at > 99.84% confidence. 

Analytical predictions of clustering are limited to 
spherical volumes, whereas o bservations select galax- 
ies within a pen cil beam (Trent i fc Sti avelH 120081 : 
iMufioz et all |2010T >. For more accuracy, we use a 
set of 10 cosmological simulations designed to study 
high-z galaxy formation in overdense environments 
(|Romano-Diaz et al.ll201lT) . These runs follow only the 
evolution of dark matter and similar to the cosmic vari- 
ance estimates constructed in this paper rely on the 
assumption that light traces matter 4 . The simulations 
have a comoving volume (28.6 Mpc) 3 and mass resolu- 
tion m p = 3 x 10 8 M Q . Halos with 10 11 Mq are thus 
well re solved with several hundred particles (|Trenti et al.l 
l2010al) . The initial conditions were constrained to con- 
tain a Mh ~ 10 12 M halo at z = 6, imposing the 
input overdensity at th e center of the box with the 
Hoff man fe Ribakl (|1991h method. These simulations 
have a central dark-matter halo with {Mh{z = 8.08)) ~ 
5 x 10 11 Mq at z — 8 and are therefore appropriate 
for investigating the environment of BoRG58_l 7871420. 
Within a 70" x 70" field of view with a line-of-sight ex- 
tension of 19 Mpc (Az fx 0.05), there are on average 
N ~ 6.4 halos with M h ^ 10 11 M©. We find N = 10 
in the realization with the highest abundance of halos 
(Figure 0. For comparison, we expect N = 0.013 halos 
in such volume for a random reg ion of the univ e rse at 
z = 8. Numerical simulations by lOverzier et all (2009) 
show that projected overdensities of dropouts might not 
be associated with actual 3-D overdensities, but the prob- 
ability is negligible for the field of view considered here, 
as we demonstrated from our analysis of numerical sim- 
ulations in Section 1431 fp = 2.5 x 10" 4 ). 

These theoretical expectations indicate that, if the 
bright z ~ 8 candidate in field BoRG58 lives in a mas- 
sive dark-matter halo, then the overdensity of fainter 
Yo98-dropouts in its proximity is naturally explained as a 
proto-cluster structure. Although without spectroscopic 
confirmation we cannot demonstrate that the overdensity 
is composed of z ~ 8 galaxies at a common redshift, all 
other hypotheses (either contamination from foreground 
galaxies or Galactic stars, or 7.5 < z < 8.5 galaxies 
at different redshifts) are highly unlikely (Section I4.3[) . 
Of course, the current data cannot exclude that some 
of the faint dropouts are at different redshifts with re- 
spect to the bright dropout (including being foreground 
contaminants). Our numerical simulations indicate that 
deeper data would not only confirm the nature of the 
currently identified Yo98-dropouts, but more importantly 
they would discover additional, fainter candidates living 
in the proximity of the massive dark matter halo that 
hosts BoRG58_17871420. 

Assuming that the BoRG58 overdensity is composed of 
galaxies at the same redshift, our modeling gives us in- 
sight on the fate of the overdensity as it evolves with 

4 In passing, we note that the conclusion offRomano-Dia z et al.l 
1120111) was that the predicted clustering of Lyman-break galaxies 
around the brightest z ~ 6 QSOs was stronger than observed, un- 
der the assumption that all these QSOs live in dark matter halos 
with masses inferred from abundance matching (Mh > 10 12 Mq). 
Therefore it was proposed there that the tension with the observa- 
tions could be eased if so me of the QSOs are i nstead hosted in less 
massive halos. Recently, Volontcri & Stark (2011) showed that, 
because of the SDSS magnitude-limited selection, this is likely to 
be the case. 



time. If the five galaxies are within a sphere of ra- 
dius r = 1.5 comoving Mpc, then the region is already 
non-linear, with an average overdensity Ap/ (p) > 10 
by counting just the mass within the galaxy host ha- 
los. From our cosmological simulations, we derive that 
these galaxies are likely gravitationally bound and will 
collapse by z ~ 3 into a single dar k-matter halo (see 
Figure 7 in lRomano-Diaz et al.|[20TlT ) . This is confirmed 
by extended Press-Schechter modeling, which takes into 
account the effects of halo growth and clustering, allow- 
ing us to derive the probability p(Mh > M2, z%\M\, z\) 
that a dark-matter halo of mass Mh = Mi at redshift z\ 
will evolv e into a halo with m ass Mh > Mi at redshift 
z 2 < Z\ (jLacev fc Cold I1993D . We used the extend ed 
Press-Schechter code developed in lTrenti et al.l ()2008l ) to 
derive the expected mass distribution for the descendants 
of a BoRG58-like halo (mass M h {z = 8) = 7 x 10 7 M ) at 
lower redshift, as shown in right panel of Figure [7] The 
figure predicts that the BoRG58 overdensity will evolve 
into one of the first massive galaxy clusters formed in the 
universe, with M h > 2 x 10 14 M by z = 0. 

6. SUMMARY AND CONCLUSIONS 

Assuming a correlation between galaxy luminosity and 
dark-matter halo mass (|Vale fc Ostrikerl [200l iCooravl 
I2005t iTrenti et al.ll2010T) . a natural prediction of dark- 
matter clustering is that the brightest galaxies at z > 6 
should be surrounded by an overdensity of fa inter galax- 
ies at similar redshift (jMunoz jfcLocb 2008a). In this pa- 
per we tested this scenario using multi-band HST data 
from the BoRG survey (Til). Initial results from the 
survey identified four bright z ~ 8 candidates as lo98- 
dropout sources from a large area medium-deep survey in 
four bands (V, Y, J, H), covering 130 arcmin 2 to a median 
5c sensitivity of tuab ~ 27. As the four bright candi- 
dates in Til are detected at high confidence (S/N > 8), 
in this paper we searched for fainter dropouts (detected 
at 5 < S/N < 8) with the goal of testing whether the 
faint sources are found preferentially in the fields contain- 
ing bright candidates. Our extended search (see Section[3] 
and Table [lj identified 17 additional dropouts within 23 
independent fields. Eight of the new sources are located 
within the four fields that contain a bright Yogs-dropout 
identified by Til. Each of these fields contains at least 
one faint dropout. By means of Monte-Carlo simulations 
we established that there is a random chance p = 0.0016 
that this occurs if no correlation between bright and faint 
dropouts is present. Therefore, the theoretical expecta- 
tion that such correlation is present is verified from the 
BoRG survey data at 99.84% confidence. This result 
holds independent of the contamination rate and com- 
pleteness of the Yo98 _ dropout selection. Furthermore, 
contamination and incompleteness reduce the strength 
of such correlation (Section [3]). Therefore, our result is 
robust against these effects. The presence of the corre- 
lation suggests that the samples of both bright and faint 
dropouts are not severely affected by contamination, as 
this would weaken the strength of the cross-correlation 
signal. 

Interestingly, field BoRG58, in which Til identified 
the best z ~ 8 candidate of the BoRG survey based on 
the photometric data, shows the strongest overdensity of 
faint Yo98 _ dropouts: four additional sources are identified 
in this paper. The other three fields containing a bright 



Galaxy Clustering at z ~ 8 from the BoRG Survey 



9 



candidate from Til still show an excess of fainter sources 
compared to the typical BoRG survey number counts, 
but their overdensities are more modest (one or two faint 
dropouts per field). There can be several contributing 
factors that lead to the difference between BoRG58 and 
the other fields with a bright candidate. First, the bright 
dropout in field BoRG58 is the one with the highest 
signal-to-noise detection {SjNjxib = 13.2). Therefore, 
this gives the broadest dynamic range among the fields 
with bright candidates to search for fainter companions 
with S/Nji25 > 5. Another possibility is that this is 
due to scatter in the halo-mass versus galaxy luminosity 
relation, typic ally modeled as a log-normal distribution 
(Cooray 2005). In this case, the bright dropout in field 
BoRG58 might live in a halo with (above) average mass, 
while the other bright candidates might be over-luminous 
for their halo mass (and hence not clustered as strongly) . 
This scenario could be favored because of the observa- 
tional bias toward outliers present in a magnitude-limited 
sample, in analogy to what has been di scussed in the con- 
text o f the z ~ 6 QSOs luminosities bv lVolonteri fe Starkl 
(|2011|) . Finally, the other very bright Fo98-dropout of the 
sample, a vtiab = 25.5 galaxy in field BoRGlk, may be a 
contaminant, as discussed in Section 5.2 of Til. In this 
respect, it is interesting to note that the next best z ^ 8 
candidate identified by Til is in field BoRGOt, which 
contains the second most significant overdensity of faint 
Yo9sm~ dropouts, with two additional sources detected 
in addition to the tuab = 26.7 F098 galaxy discussed in 
Til. This galaxy is almost one magnitude fainter than 
the one in BoRG58, so it is not surprising to find fewer 
members in the overdensity. 

Given the special interest of field BoRG58, we carried 
out extensive analysis and data interpretation. In Sec- 
tion m we quantified by means of extensive testing that 
the overdensity of five dropouts in field BoRG58 is sig- 
nificant at 99.97% — 99.999% confidence under Poisson 
statistics, with the range depending on a priori assump- 
tions. As in the case of the global analysis for correlation 
among counts in the survey, this result is independent of 
contamination and completeness. We also quantified the 
impact of galaxy clustering in the determination of the 
statistical significance of the overdensity. Under the most 
conservative scenario of a complete survey with no con- 
tamination (this choice maximizes the bias of the Yog$- 
dropout population), we derive that the overdensity of 
dropouts in BoRG58 lies 8.9<r away from the average 
number count. To quantify the exact probability associ- 
ated with this we constructed the number count distri- 
bution by tracing a pencil beam through a cosmolo gical 
simulation as described in iTrenti fc Stiavellil ()2008l ) . We 
derive that five counts as in the BoRG58 overdensity are 
associated with a clustered structure at a common red- 
shift with probability p > 99.975%. 

These results strengthen our confidence that the bright 
logs-dropout BoRG58.17871420 is indeed at z ~ 8 and 
lives in a dark-matter halo of mass Mh ~ (4 — 7) x 
10 11 M Q (5fi density peak), rather than being a lower 
rcdshift contaminant. If that galaxy were at z ~ 1.5 — 2, 
it would be a low-luminosity galaxy hosted in a common 
< 2cr peak which would n ot be surrounded by a n over- 
density of similar sources (IMunoz fc Loebll2008ah . Fur- 
ther evidence for the z ~ 8 nature of the source is give n 
by Keck-NIRSPEC spectroscopy ([Schenker et al.l 12011( 1 



While no Lya emission line has been detected, to a me- 
dian equivalent width of EW > 40 A, this spectroscopic 
follow-up rules out that the galaxy is a foreground con- 
taminant at z r~j 2 with flux in the J and H bands pri- 
marily contributed by strong emission lines, which would 
have been detected in the spectrum at very high confi- 
dence (> 20ct). 

The faint Yo98-dropouts in BoRG58 are however at the 
detection limit of the current data and likely affected 
by significant contamination and incompleteness (Sec- 
tion [42]). To confirm beyond reasonable doubt the ex- 
istence of a physical 3D overdensity, and to explore in 
more detail its properties, new observations are needed. 
Deeper HST data, both in the optical and in the near- 
IR would improve the rejection of contaminants and 
likely discover new members of the overdensity, as ex- 
pected from our theoretical/numerical model of the over- 
density (Section [5]). In this respect, we note that our 
catalog contains several sources in the field that are 
candidate lo98-dropouts but are only detected in the 
J band at 4 < S/Nji2b < 5 confidence. In addition, 
Spitzer/IRAC observations (3.6 /mi and 4.5 /tm) would 
allow us to exclude to higher confidence dusty and pas- 
sively evolving z ~ 1.5 contaminants, and more impor- 
tantly to derive the age and stellar mass for the brightest 
dropout, BoRG58_17871420. These observations could 
rule out that this ultra-bright z ~ 8 candidate is a young 
(t < 5 Myr) and not very massive starburst, a scenario 
which we did not consider in our theoretical modeling of 
the overdensity. In fact, t ypical z ~ 8 sourc es have ages 
of t ~ 100 - 300 Myr (see lLabbe et al.ll2010h . 

Finally, it will be possible to further investigate the na- 
ture of the BoRG58 overdensity with the next generation 
of ground-base d, multi-object spectro graphs. The Keck 
observations of ISchenker et al.l ([20111 ) are only moder- 
ately deep, with five hours of integration time split in two 
different filters to cover the exp ected wavelength ra nge 
of Lya emission (as a reference, iLehnert et al.ll20"Tol ob- 
served with the VLT SINFONI spectrograph a z ~ 8 
galaxy for about 15 hours in a single filter). Galax- 
ies at z ~ 8 are likely to have smaller Lya equivalent 
width compared to lower redshift objects. Theoretical 
predictions sugge st that Lya emission sh ould be present 
with EW > 15 A ([Daval fc Ferrarall2011h and that some 
Lya flux should be transmitted even in case the IGM is 
neutral beca use of radiative transfer in duced by galac- 
tic outflows ([Dijkstra fc Wvithd \20Wt ). This limit is 
within reach of a deeper spectroscopic follow-up from the 
ground and especially with a multi-object spectrograph 
such MOSFIRE at Keck, which could provide the defini- 
tive evidence that we are witnessing the assembly of one 
of the first galaxy clusters in the Universe. 
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Fig. 1. — Left panel: F098M-dropouts color-color selection. Black squares indicate the five z m 8 candidates (with lcr error-bars on 
colors). Cyan lines denote the selection window. Blue dots are simulated low-redshift interlopers; red triangles are z > 7.5 galaxies. L/T 
dwarf stars would appear in green shaded area. Right panel: photometric redshift distribution derived from Monte-Carlo source-recovery 
simulations (top: BoRG58_17871420; bottom: fainter sources). 



TABLE 1 

Y098-DROPOUT (z ~ 8 CANDIDATES) IN THE BoRG 
SURVEY. 



Field 


™Vl25 


S/N J125 


R.A. 


Decl. 


BoRG70 


26.4 


5.2 


157.7291 


+38.0474 


B0RG66 


26.2 


8.7 


137.2732 


-0.0297 


B0RG66 


26.5 


7.0 


137.2879 


-0.0338 


BoRG58 


25.8 


13.0 


219.2107 


+50.7260 


BoRG58 


27.2 


5.1 


219.2241 


+50.7260 


BoRG58 


26.9 


5.5 


219.2311 


+50.7241 


BoRG58 


27.2 


5.4 


219.2203 


+50.7156 


BoRG58 


27.0 


6.0 


219.2224 


+50.7081 


BoRG2t 


26.6 


6.8 


95.9036 


-64.5480 


BoRGlv 


26.4 


5.2 


187.4776 


+7.8286 


BoRG lk 


25.5 


11.4 


247.8968 


+37.6039 


BoRG lk 


26.9 


6.1 


247.8981 


+37.6048 


BoRGOy 


27.0 


6.5 


177.9196 


+54.6847 


BoRGOy 


26.6 


7.9 


177.9726 


+54.6995 


BoRGOy 


27.2 


6.3 


177.9751 


+54.6979 


BoRGOj 


26.9 


5.1 


178.1887 


+0.9340 


BoRGOc 


26.7 


6.6 


118.9794 


+30.7178 


BoRGOg 


26.5 


6.1 


124.8104 


+49.1775 


BoRGOt 


26.7 


8.6 


117.7142 


+29.2715 


BoRGOt 


27.1 


6.5 


117.7064 


+29.2977 


BoRGOt 


27.0 


5.9 


117.6965 


+29.2851 



Note. — First column: survey field ID. Second col- 
umn: total magnitude in the J band, including aper- 
ture correction (automag). Third column: detection 
S/N in the J band (isophotal measurement). Fourth 
and fifth columns: Yogs-dropout coordinates (Degrees, 
J2000 system). 
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Fig. 2. — Number counts distribution of the Y-band dropouts within the 23 BoRG fields considered in this paper (solid histogram), 
lines show a Poisson distribution with the same mean ({N) = 0.913 per field). 
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TABLE 2 

Y"o98-DROPOUT (z ~ 8) SOURCES IN FIELD BORG58. 



ID 


Magtot 


Position 




Photometry [isomaj 






Jl25 [automag] 


R.A. 


Decl. 


Veo6 


Vo98 


Jl25 


#160 


BoRG58_17871420 


25.8 ±0.1 


219.2107 


+50.7260 


> 28.6 


> 28.8 


25.9 [13.0a] 


26.0 [8. Oct] 


BoRG58_14061418 


27.2 ±0.3 


219.2241 


±50.7260 


> 29.0 


> 29.4 


27.5 [5.1a] 


27.8 [2.6<r] 


BoRG58_12071332 


26.9 ±0.2 


219.2311 


±50.7241 


> 28.3 


> 29.3 


27.3 5.5<t] 


27.6 [2.7a] 


BoRG58_15140953 


27.2 ±0.2 


219.2203 


±50.7156 


> 28.5 


> 29.2 


27.4 [5.4a] 


27.7 [2.7a] 


BoRG58_14550613 


27.0 ±0.2 


219.2224 


±50.7081 


> 29.0 


> 29.3 


27.2 [6.2<r] 


27.6 [2.9cr] 



Note. — First column: source ID. Second column: total magnitude (J125, including aperture 
correction (automag). Third/fourth columns: R.A. /Decl. (Deg), J2000 system. Final columns: 
source photometry within detection pixels (isomag), with detection significance within brackets, or 
la upper limits. 
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Fig. 3. — Left panel: J125 image of field BoRG58, with Yogs-dropouts indicated by blue circles. Right panel: postage-stamp images 
(3". 2 x 3". 2) of sources: BoRG58.17871420, BoRG58.14061418, BoRG58.12071332, BoRG58.15140953, and BoRG58.14550613 (top to 
bottom). 




Fig. 4. — Completeness as a function of recovered mji25 for the Yo98-dropout selection in field BoRG58. T he data are 50% complete at 
mj = 26.88, estimated using Monte-Carlo experiments of source recovery as described in Ocsch ct al. (2007). 
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Fig. 5. — Centered-stacked postage-stamp images (3". 2 X 3". 2) for Y"o98-dropouts i n BoRG58 (top: four fainter sources. Bottom: all 
sources). Stacks have no flux in V606 and Y"o98, as expected for z ~ 8 sources: S/Nyg g = 0.9, S/Nyggg = 0.6, S/Ny^^ = 1.2, 

s/<? 9 ; nt) = i.25. 
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Fig. 6. — Number count distribution of yo98-dropouts in a pencil beam of comoving volume 2.83 X 2.83 X 311 Mpc 3 (corresponding to 
62" X 62"xAz, with Az = 1.0 at (z) = 8.0), traced through a cosmological simulation with box edge li, ox = 143 Mpc (comoving) and 
N v = 512 3 DM particles (Trenti & Stiavelli 2008). Only 5 out of 4000 random realizations of the pencil beam contain an overdensity of at 



least 5 sources. In four of the five cases the overdensity is associated with a 3D proto-cluster. A random superposition of logs-dropouts at 
different redshifts along the pencil beam happens with probability p = 1/4000 = 2.5 X 10~ 4 , thereby suggesting a physical origin for the 
BoRG58 overdensity to a high degree of confidence. 
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Fig. 7. — Left: Dark-matter halo distribution at z = 8.08 for simulated proto-cluster (comoving volume 11 X 11 X 19 Mpc 3 ; from 
I Roman o-Diaz ct al. 2 0111 ). > 10 11 halos (red and blue circles) host galaxies likely detectable in the BoRG58 field at the current 
depth (the blue circle denotes the most massive halo in the simulation). Green circles are less massive halos (10 10 Mq < < 10 11 M©). 
Right: BoRG58 overdensity evolution predicted from extended Press-Schec hter modeling. Bla ck line: median of probability that a z = 8 
halo with = 7 X 10 Mq evolves into a halo of mass M 2 at 22 (from Prrenti et alj|2008h . Blue/red lines are la and 2<r confidence 
contours. 



